智能论文笔记

Refining Control Barrier Functions through Hamilton-Jacobi Reachability

Sander Tonkens , Sylvia Herbert

分类：机器人

2022-04-26

基于控制屏障功能（CBF）的安全过滤器已成为自治系统安全至关重要控制的实用工具。这些方法通过价值函数编码安全性，并通过对该值函数的时间导数施加限制来执行安全。但是，在存在输入限制的情况下合成并非过于保守的有效CBF是一个臭名昭著的挑战。在这项工作中，我们建议使用正式验证方法提炼候选CBF，以获得有效的CBF。特别是，我们使用基于动态编程（DP）的可及性分析更新专家合成或备份CBF。我们的框架RefineCBF保证，在每次DP迭代中，获得的CBF至少与先前的迭代一样安全，并收集到有效的CBF。因此，RefineCBF可用于机器人系统。我们证明了我们在模拟中使用各种CBF合成技术来增强安全性和/或降低一系列非线性控制型系统系统的保守性的实用性。

translated by 谷歌翻译

Safe Autonomous Racing via Approximate Reachability on Ego-vision

Bingqing Chen , Jonathan Francis , Jean Oh , Eric Nyberg , Sylvia L. Herbert

分类：机器人 | 人工智能 | 机器学习

2021-10-14

当任何安全违规可能导致灾难性失败时，赛车要求每个车辆都能在其物质范围内驾驶。在这项工作中，我们研究了自主赛车的安全强化学习（RL）的问题，使用车辆的自我摄像机视图和速度作为输入。鉴于任务的性质，自主代理需要能够1）识别并避免复杂的车辆动态下的不安全场景，而2）在快速变化的环境中使子第二决定。为了满足这些标准，我们建议纳入汉密尔顿 - 雅各（HJ）可达性理论，是一般非线性系统的安全验证方法，进入受约束的马尔可夫决策过程（CMDP）框架。 HJ可达性不仅提供了一种了解安全的控制理论方法，还可以实现低延迟安全验证。尽管HJ可达性传统上不可扩展到高维系统，但我们证明了具有神经逼近的，可以直接在视觉上下文中学习HJ安全值 - 迄今为止通过该方法研究的最高尺寸问题。我们在最近发布的高保真自主赛车环境中评估了我们在几个基准任务中的方法，包括安全健身房和学习（L2R）。与安全健身房的其他受约束的RL基线相比，我们的方法非常少的限制性违规，并在L2R基准任务上实现了新的最先进结果。我们在以下匿名纸质网站提供额外可视化代理行为：https://sites.google.com/view/safeautomouracing/home

translated by 谷歌翻译

High-resolution canopy height map in the Landes forest (France) based on GEDI, Sentinel-1, and Sentinel-2 data with a deep learning approach

Martin Schwartz , Philippe Ciais , Catherine Ottlé , Aurelien De Truchis , Cedric Vega , Ibrahim Fayad , Martin Brandt , Rasmus Fensholt , Nicolas Baghdadi , François Morneau

分类：计算机视觉

2022-12-20

In intensively managed forests in Europe, where forests are divided into stands of small size and may show heterogeneity within stands, a high spatial resolution (10 - 20 meters) is arguably needed to capture the differences in canopy height. In this work, we developed a deep learning model based on multi-stream remote sensing measurements to create a high-resolution canopy height map over the "Landes de Gascogne" forest in France, a large maritime pine plantation of 13,000 km$^2$ with flat terrain and intensive management. This area is characterized by even-aged and mono-specific stands, of a typical length of a few hundred meters, harvested every 35 to 50 years. Our deep learning U-Net model uses multi-band images from Sentinel-1 and Sentinel-2 with composite time averages as input to predict tree height derived from GEDI waveforms. The evaluation is performed with external validation data from forest inventory plots and a stereo 3D reconstruction model based on Skysat imagery available at specific locations. We trained seven different U-net models based on a combination of Sentinel-1 and Sentinel-2 bands to evaluate the importance of each instrument in the dominant height retrieval. The model outputs allow us to generate a 10 m resolution canopy height map of the whole "Landes de Gascogne" forest area for 2020 with a mean absolute error of 2.02 m on the Test dataset. The best predictions were obtained using all available satellite layers from Sentinel-1 and Sentinel-2 but using only one satellite source also provided good predictions. For all validation datasets in coniferous forests, our model showed better metrics than previous canopy height models available in the same region.

translated by 谷歌翻译

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

Harry Coppock , George Nicholson , Ivan Kiskin , Vasiliki Koutra , Kieran Baker , Jobie Budd , Richard Payne , Emma Karoune , David Hurley , Alexander Titcomb

分类：机器学习

2022-12-15

Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata, including reverse transcription polymerase chain reaction (PCR) test outcomes, of whom 23,514 tested positive for SARS CoV 2. Subjects were recruited via the UK governments National Health Service Test-and-Trace programme and the REal-time Assessment of Community Transmission (REACT) randomised surveillance survey. In an unadjusted analysis of our dataset AI classifiers predict SARS-CoV-2 infection status with high accuracy (Receiver Operating Characteristic Area Under the Curve (ROCAUC) 0.846 [0.838, 0.854]) consistent with the findings of previous studies. However, after matching on measured confounders, such as age, gender, and self reported symptoms, our classifiers performance is much weaker (ROC-AUC 0.619 [0.594, 0.644]). Upon quantifying the utility of audio based classifiers in practical settings, we find them to be outperformed by simple predictive scores based on user reported symptoms.

translated by 谷歌翻译

A large-scale and PCR-referenced vocal audio dataset for COVID-19

Jobie Budd , Kieran Baker , Emma Karoune , Harry Coppock , Selina Patel , Ana Tendero Cañadas , Alexander Titcomb , Richard Payne , David Hurley , Sabrina Egglestone

分类：机器学习

2022-12-15

The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmission of the Alpha and Delta SARS-CoV-2 variants and some Omicron variant sublineages. Audio recordings of volitional coughs, exhalations, and speech were collected in the 'Speak up to help beat coronavirus' digital survey alongside demographic, self-reported symptom and respiratory condition data, and linked to SARS-CoV-2 test results. The UK COVID-19 Vocal Audio Dataset represents the largest collection of SARS-CoV-2 PCR-referenced audio recordings to date. PCR results were linked to 70,794 of 72,999 participants and 24,155 of 25,776 positive cases. Respiratory symptoms were reported by 45.62% of participants. This dataset has additional potential uses for bioacoustics research, with 11.30% participants reporting asthma, and 27.20% with linked influenza PCR test results.

translated by 谷歌翻译

The Who in Code-Switching: A Case Study for Predicting Egyptian Arabic-English Code-Switching Levels based on Character Profiles

Injy Hamed , Alia El Bolock , Cornelia Herbert , Slim Abdennadher , Ngoc Thang Vu

分类：自然语言处理

2022-07-31

代码转换（CS）是多语言个体所表现出的常见语言现象，在一次对话中，它们倾向于在语言之间交替。 CS是一种复杂的现象，不仅包含语言挑战，而且还包含大量的复杂性，就其在说话者之间的动态行为而言。鉴于产生CS的因素因一个国家而异，并且从一个人到另一个人都不同，因此发现CS是一种依赖说话者的行为，在该行为中，外语被嵌入的频率在说话者之间有所不同。尽管几位研究人员从语言的角度研究了CS行为，但研究仍然缺乏从社会学和心理学角度预测用户CS行为的任务。我们提供了一项经验用户研究，我们研究用户的CS级别和性质特征之间的相关性。我们对双语者进行访谈，并收集有关他们的个人资料的信息，包括他们的人口统计学，个性特征和旅行经验。然后，我们使用机器学习（ML）根据其配置文件来预测用户的CS级别，在此我们确定建模过程中的主要影响因素。我们试验分类和回归任务。我们的结果表明，CS行为受到说话者之间的关系，旅行经验以及神经质和外向性人格特征的影响。

translated by 谷歌翻译

Applying data technologies to combat AMR: current status, challenges, and opportunities on the way forward

Leonid Chindelevitch , Elita Jauneikaite , Nicole E. Wheeler , Kasim Allel , Bede Yaw Ansiri-Asafoakaa , Wireko A. Awuah , Denis C. Bauer , Stephan Beisken , Kara Fan , Gary Grant

分类：人工智能 | 机器学习

2022-07-05

抗微生物抗性（AMR）是日益增长的公共卫生威胁，估计每年造成超过1000万人死亡，在现状预测下，到2050年，全球经济损失了100万亿美元。这些损失主要是由于治疗失败的发病率和死亡率增加，医疗程序中的AMR感染以及归因于AMR的生活质量损失所致。已经提出了许多干预措施来控制AMR的发展并减轻其传播带来的风险。本文回顾了细菌AMR管理和控制的关键方面，这些方面可以利用人工智能，机器学习以及数学和统计建模等数据技术，这些领域在本世纪已经快速发展。尽管数据技术已成为生物医学研究的组成部分，但它们对AMR管理的影响仍然很小。我们概述了使用数据技术来打击AMR，详细介绍了四个互补类别的最新进展：监视，预防，诊断和治疗。我们在生物医学研究，临床实践和“一个健康”背景下使用数据技术提供了有关当前AMR控制方法的概述。我们讨论了数据技术的潜在影响和挑战在高收入和中等收入国家中面临的实施，并建议将这些技术更容易地整合到医疗保健和公共卫生中所需的具体行动，并建议使用具体的行动部门。

translated by 谷歌翻译

Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models

Virginia K. Felkner , Ho-Chun Herbert Chang , Eugene Jang , Jonathan May

分类：自然语言处理

2022-06-23

本文介绍了探索性的工作，介绍了以及在何种程度上对酷儿和跨性别者的偏见是用大型语言模型（LLM）（例如伯特）编码的。我们还提出了一种减少下游任务中这些偏见的方法：对由和/或关于酷儿人编写的数据进行填充。为了衡量抗Quase偏见，我们引入了一个新的基准数据集Winoqueer，以其他偏置检测基准测试，但要解决同性恐惧和跨性别偏见。我们发现伯特表现出明显的同性恋偏见，但是这种偏见可以通过finetuning bert对LGBTQ+社区成员撰写的自然语言语料库进行缓解。

translated by 谷歌翻译

An Empirical Study of Quantum Dynamics as a Ground State Problem with Neural Quantum States

Vladimir Vargas-Calderón , Herbert Vinck-Posada , Fabio A. González

分类：机器学习

2022-06-18

神经量子状态是通过人工神经网络参数化的变异波函数，这是一种数学模型，在机器学习社区中数十年。在多体物理学的背景下，诸如具有神经量子状态的变异蒙特卡洛作为变异波函数之类的方法在近似精确的近似性方面是成功的，即量子哈密顿量的基础。但是，提出神经网络体系结构的所有困难，以及探索其表现力和训练性，都渗透到其作为神经量子状态的应用。在本文中，我们考虑了Feynman-Kitaev Hamiltonian的横向场模型，该模型的基态编码在离散时间步骤下旋转链的时间演变。我们展示了该基础状态问题如何特别挑战神经量子状态的训练性，因为时间步骤的增加，因为真实的基态变得更加纠缠，并且概率分布开始遍及希尔伯特空间。我们的结果表明，所考虑的神经量子状态能够准确地近似系统的真实基态，即它们具有足够的表现。然而，广泛的超参数调整实验表明，经验事实是，在变化的蒙特卡洛设置中，训练性较差 - 可以防止对真实基态的忠实近似。

translated by 谷歌翻译

Predicting User Code-Switching Level from Sociological and Psychological Profiles

Injy Hamed , Alia El Bolock , Nader Rizk , Cornelia Herbert , Slim Abdennadher , Ngoc Thang Vu

分类：自然语言处理

2021-12-13

多种语言的扬声器倾向于在对话中的语言之间交替，该现象称为“代码切换”（CS）。CS是一种复杂的现象，不仅包括语言挑战，而且在讲话者的动态行为方面也包含大量复杂性。社会学家和心理学家研究了这种动态行为，确定了影响CS的因素。在本文中，我们对阿拉伯语 - 英语CS提供了实证用户研究，在那里我们展示了用户CS频率和字符特征之间的相关性。我们使用机器学习（ML）来验证调查结果，通知和确认现有理论。预测模型能够预测用户的CS频率，精度高于55％，其中旅行经验和人格特征在建模过程中起最大的作用。

translated by 谷歌翻译